Tftol: Using Genetic Algorithms for Attribute Grouping in Multivariate Microaggregation U Sing Genetic Aigorithms for Attribute Grouping in M Ultivariate Microaggregation
نویسندگان
چکیده
Acknowledgements Foremost, I would like to thank my daily supervisors Victor Muntés and Jordi Nin for their support and guidance during the development of this project. Without their help, this thesis would have not been possible. Also my gratitude to Josep LluÍs Larriba for giving me the opportunity to develop the project at the DAMA-UPC research group. I would also want to thank the people from the CRISES research group of the Universitat Rovira i Virgili for providing useful source code of several quality measures for protected microdata. Very special thanks to Joan Guisado, Arnau Prat and David Dominguez for making my work enjoyable during this year. Last but not least, to my parents, my brother and Laia for their love, support and encouragement.
منابع مشابه
Using genetic algorithms for attribute grouping in multivariate microaggregation
Anonymization techniques that provide k-anonymity suffer from loss of quality when the data dimensionality is high. Microaggregation techniques are not an exception. Given a set of records, attributes are grouped into non-intersecting subsets and microaggregated independently. While this improves quality by reducing the loss of information, it usually leads to the loss of the k-anonymity proper...
متن کاملRepeated Record Ordering for Constrained Size Clustering
One of the main techniques used in data mining is data clustering, which has many applications in computer science, biology, and social sciences. Constrained clustering is a type of clustering in which side information provided by the user is incorporated into current clustering algorithms. One of the well researched constrained clustering algorithms is called microaggregation. In a microaggreg...
متن کاملSemantic adaptive microaggregation of categorical microdata
In the context of Statistical Disclosure Control, microaggregation is a privacy preserving method aimed to mask sensitive microdata prior to publication. It iteratively creates clusters of, at least, k elements, and replaces them by their prototype so that they become k-indistinguishable (anonymous). This data transformation produces a loss of information with regards to the original dataset wh...
متن کاملPractical Data-Oriented Microaggregation for Statistical Disclosure Control
ÐMicroaggregation is a statistical disclosure control technique for microdata disseminated in statistical databases. Raw microdata (i.e., individual records or data vectors) are grouped into small aggregates prior to publication. Each aggregate should contain at least k data vectors to prevent disclosure of individual information, where k is a constant value preset by the data protector. No exa...
متن کاملA Polynomial Algorithm for Optimal Microaggregation
− Microaggregation is a technique that is used by statistical agencies to limit disclosure of sensitive microdata. Noting that no polynomial time algorithms are known to microaggregate optimally, Domingo-Ferrer and Mateo-Sanz have presented heuristic methods based on hierarchical clustering and genetic algorithms to identify sub-optimal solutions. We present an efficient polynomial time algorit...
متن کامل